Fix nit in LoRA doc #1054

awni · 2024-10-16T19:40:27Z

Very tiny fix, closes #1053

madroidmaq · 2024-10-17T08:53:42Z

hi @awni , According to OpenAI's official documentation, this may not be a bug, as arguments is not an object but a "string" type "object" that needs to be deserialized into a real object.

For details, see: https://platform.openai.com/docs/guides/fine-tuning/fine-tuning-examples

hansvdam · 2024-10-17T14:34:36Z

Yes it is true that it should be like that according to the openai docs, but MLX does not fine-tune well if you adhere to that...
I guess there is something wrong under the hood then with the interpretation of the training data in that format...
I used it for fine-tuning meta-llama/Llama-3.1-8B-Instruct

madroidmaq · 2024-10-17T17:23:50Z

If you check the chat_temlpate configuration in the meta-llama/Llama-3.1-8B-Instruct repository, you may find that the reason could be the requirement for the function return format, which is inconsistent with the OpenAI format. Llama-3.1-8B-Instruct requires a dictionary to be returned.

You have access to the following functions. To call a function, please respond with JSON for a function call.
'Respond in the format {"name": function name, "parameters": dictionary of argument name and its value}.'

So if you return according to the document's format, some issues may arise (the format in the base model is inconsistent with the fine-tuning data format). When I reviewed the HuggingFace chat_template document again, I found it also highlighted this part:

If you’re familiar with the OpenAI API, you should pay attention to an important difference here - the tool_call is a dict, but in the OpenAI API it’s a JSON string. Passing a string may cause errors or strange model behaviour!

When I checked the mistral-finetune project again, I found that its data usage is consistent with the OpenAI format.

So I think there might not be a strictly correct format here. The key point is to ensure that the format of your fine-tuning dataset needs to be consistent with the base model's format, otherwise problems will arise. I think this part can be explained in the documentation.

fix nit in docs

dfac3e6

awni requested review from angeloskath and barronalex October 16, 2024 19:40

angeloskath approved these changes Oct 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix nit in LoRA doc #1054

Fix nit in LoRA doc #1054

awni commented Oct 16, 2024

madroidmaq commented Oct 17, 2024 •

edited

Loading

hansvdam commented Oct 17, 2024 •

edited

Loading

madroidmaq commented Oct 17, 2024

Fix nit in LoRA doc #1054

Are you sure you want to change the base?

Fix nit in LoRA doc #1054

Conversation

awni commented Oct 16, 2024

madroidmaq commented Oct 17, 2024 • edited Loading

hansvdam commented Oct 17, 2024 • edited Loading

madroidmaq commented Oct 17, 2024

madroidmaq commented Oct 17, 2024 •

edited

Loading

hansvdam commented Oct 17, 2024 •

edited

Loading